An Empirical Examination of Challenges in Chinese Parsing
نویسندگان
چکیده
Aspects of Chinese syntax result in a distinctive mix of parsing challenges. However, the contribution of individual sources of error to overall difficulty is not well understood. We conduct a comprehensive automatic analysis of error types made by Chinese parsers, covering a broad range of error types for large sets of sentences, enabling the first empirical ranking of Chinese error types by their performance impact. We also investigate which error types are resolved by using gold part-of-speech tags, showing that improving Chinese tagging only addresses certain error types, leaving substantial outstanding challenges.
منابع مشابه
An improved joint model: POS tagging and dependency parsing
Dependency parsing is a way of syntactic parsing and a natural language that automatically analyzes the dependency structure of sentences, and the input for each sentence creates a dependency graph. Part-Of-Speech (POS) tagging is a prerequisite for dependency parsing. Generally, dependency parsers do the POS tagging task along with dependency parsing in a pipeline mode. Unfortunately, in pipel...
متن کاملCultural Differences Encountered by a Novice Chinese Immersion Teacher in an American Kindergarten Immersion Classroom
The research objective of this study was to explore the cultural differences and challenges encountered by the Chinese Immersion Teacher (CIT) and how the CIT deal with the cultural differences in the immersion classroom. A qualitative case study approach was chosen for this research. The participant was a novice kindergarten immersion teacher who was born and educated in a Chinese-speaking cou...
متن کاملCultural Differences Encountered by a Novice Chinese Immersion Teacher in an American Kindergarten Immersion Classroom
The research objective of this study was to explore the cultural differences and challenges encountered by the Chinese Immersion Teacher (CIT) and how the CIT deal with the cultural differences in the immersion classroom. A qualitative case study approach was chosen for this research. The participant was a novice kindergarten immersion teacher who was born and educated in a Chinese-speaking cou...
متن کاملبررسی مقایسهای تأثیر برچسبزنی مقولات دستوری بر تجزیه در پردازش خودکار زبان فارسی
In this paper, the role of Part-of-Speech (POS) tagging for parsing in automatic processing of the Persian language is studied. To this end, the impact of the quality of POS tagging as well as the impact of the quantity of information available in the POS tags on parsing are studied. To reach the goals, three parsing scenarios are proposed and compared. In the first scenario, the parser assigns...
متن کاملThe Challenges of Parsing Chinese with Combinatory Categorial Grammar
We apply Combinatory Categorial Grammar to wide-coverage parsing in Chinese with the new Chinese CCGbank, bringing a formalism capable of transparently recovering non-local dependencies to a language in which they are particularly frequent. We train two state-of-the-art English parsers: the parser of Petrov and Klein (P&K), and the Clark and Curran (C&C) parser, uncovering a surprising perf...
متن کامل